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SUMMARY 

The  purpose  of  tnls  paper  Ic  to  present  an  expository  account 
of  the  fundamental  Ideas  of  the  theory  of  games,  to,./ether  with  a 
discussion  of  some  of  tne  ur. resolved  asfectp  of  the  theory. 
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TliE  THEORY  OP  GAMES 

By 

Richard  Bellman 


§1 .  Introduction . 

In  recent  yearo,  a  new  branch  of  mathematical  analysis  has 
been  developec'  and  Intens  Ively  studied.  It  poBseases  a  great 
deal  of  Intrinsic  Interest  and  a  number  of  close  ties  with  other 
parts  of  mathematics  and  various  cognate  flelco  such  aa  mathema¬ 
tical  economics  and  mathematical  statistics.  In  this  article  we 
propose  to  give  a  brief  sketch  of  some  of  the  baric  Ideas  of 
what  la  now  called  the  Theory  of  Camea. 

Although  a  systematic  foundation  of  the  theory  was  begun  by 
Borel,  [b],  Independently  von  Neumann  [2?j  presented  his  own  formu¬ 
lation,  and  derived  the  basic  result  which  Is  the  cornerstone  of 
the  theory.  This  result,  whose  proof  had  eluded  Borel,  Is  called 
tne  Min-Max  theorem,  and  will  be  discussed  extensively  below. 

One  of  the  attractive  features  of  the  theory  of  games,  snared 
by  some  other  fields  of  mathematics  such  as  number  theory  and 
topology,  Is  tnat  It  Is  not  possible  to  gauge  the  deptn  of  problems 
wnlcn  can  be  formulated  In  ternp  of  quite  simple  ideas,  and  In 
very  few  words.  The  result  le  that  pome  fairly  simply  stated 
questions  lead  In  some  cases  to  quite  recondite  analysis,  and  In 
other  caees  to  tne  very  boundary  of  the  unknown. 

In  the  course  of  the  chapter  we  shall  refer  to  a  number  of 
books  and  researen  papers  which  explore  In  detail  a  number  of  t.he 
subjects  we  mention  quite  brief ly ,  and  which  contain  a  great  deal 
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more  bee  idea. 

The  most  Interettlng  and  entertaining  account  of  the  funda¬ 
mental  concepts,  valuable  for  amateur  and  professional  alike,  is 
that  contained  In  the  book  by  Williams,  [26]. 

§2 .  Matching  Colne . 

In  order  to  Illustrate  the  type  of  problems  encountered  In 
the  theory  of  games  and  the  concepts  used  to  analyze  these 
problems,  let  us  begin  our  discussion  t^lth  the  perennial  diver¬ 
sion  of  matching  coins. 

As  we  know,  the  game  proceeds  in  the  following  fashion.  At 
each  stage  of  the  game,  each  player  ciooees  to  show  a  head  or  a 
tall,  with  his  choice  unknown  to  the  ouher  placer.  It  Is  agreed 
upon  In  advance  that  one  player  ii^lns  if  the  coins  match,  and  tnat 
the  other  ^ layer  wins  If  they  do  not. 

Suppose  that  one  player  la  suddenly  Inspired  to  analyze  tne 
game  mathematically  in  the  hope  of  gaining  an  advantage  In  this 
way . 

How  does  one  anal:/ze  a  process  of  ttils  type  mathematically? 
This  question  1b  asked  deliberately,  and  not  rnetorlcally ,  to 
emphasize  first  of  all  tne  fact  tnat  before  a  .‘^atlsfan'-ory  methon 
of  analysis  Is  discovered.  It  Is  often  not  at  all  clear  how  to 
proceed,  and  further  to  emphasize  tne  related  fact  that  even  after 
one  good  metnod  hae  been  found.  It  may  rtlll  be  true  that  another 
still  better  method  Is  required  to  treat  questions  of  a  nlgner 
level  of  difficulty.  No  theory  tne;.  sh.ould  be  regarded  as  either 
Inevitable  In  Its  formulation,  or  final  In  Its  conclusions. 
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Returnlng  to  coin-matching,  It  Is  clear  that  If  we  reject 
extrasensory  perception  and  telekinesis  as  being  at  best  unproved 
and  undependable,  tnere  Is  no  way  In  which  either  player  can  gain 
an  advantage  playing  the  game  once  against  an  unknown  opponent. 

The  way  out  of  this  cul-de— sac,  which  would  seem  to  block  any 
mathematical  study,  Is  to  focus  our  attention  upon  games  which  are 
played  a  large  number  of  timer,  and  concentrate  upon  gaining  an 
advantage  In  the  long  run.  In  otner  words,  each  player  Is  to 
play  In  such  a  way  as  to  maximize  an  average  return. 

This,  of  course.  Is  one  of  the  guiding  principles  of  gaming, 
either  at  the  card  table,  on  t.he  football  field,  or  on  a  baaeball 
diamond.  It  sustains  a  bridge  master  when  he  makes  an  unfortunate 
finesse;  It  sustains  the  poker  expert  who  calls  a  straight  with 
aces  over  fourr;  It  sustains  the  baseball  manager  who  calls  for 
a  hit— and— run  which  ends  In  a  double  play.  The  playerr  who  dis¬ 
regard  the  averaging  procecs,  and  rely  instead  upon  hunches,  make 
a  number  of  spectacular  gains,  which  are  duly  advertised  by  the 
players  themselves,  and  sometimes  the  newspapers.  In  the  long 
run,  they  lose  consistently,  and  consistently  blame  their  misfor¬ 
tune  upon  bad  luck,  Inferior  partners,  poor  teamwork,  etc.,  etc., 
etc . 

Let  us  agree  t  len  that  the  at*alvtlc  player  decide^  to  play 
In  such  a  way  as  to  maximize  the  average  amount  won  at  any  parti¬ 
cular  play  of  the  game,  which  Is  equivalent  to  the  assertion 
that  he  Intends  to  play  the  game  a  large  number  of  times  and 
maximize  the  average  amount  won  In  the  course  of  tnls  large  number 
of  plays,  or  acts  as  If  he  Intended  to  follow  this  pattern.  We 
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shall  discuee  below  some  of  the  conpllcet Ions  connected  with  this 
point  of  view. 

C  nelder  the  situation  of  thle  player,  taken  to  be  the  one 
who  wins  If  the  coins  match.  He  can  argue  as  follows: 

"The  principle  of  Insufficient  reason  assures  me  that  the 
other  player  will  be  dloj)laylrig  a  head  or  a  tall  with  equal 
probability,  alnce  there  eeema  to  be  no  reeeon  why  he  should  show 
one  or  the  other.  Consequerit ly ,  no  matter  what  I  do,  I  have 
equal  probability  of  winning  or  losing.  I  mlgnt  just  as  well 
show  heads  all  the  time." 

Then  the  following  disturbing  thought  occurs: 

"Suppose  t.hat  the  other  player  Is  Irrational,  or  supi-ose 
that  I  meet  a  atrlng  of  coliH-riatcners  who  on  the  baslt  of  philoso¬ 
phical  principles,  or  as  a  result  of  election  betr,  nave  pledged 
themselves  to  sf'ow  tails  all  tne  time.  I  will  tr.et.  lose  on 
every  play.  How  can  I  guard  against  thlij  contingency?" 

A  small  amount  of  reflection  shows  that  the  flayer  can  guard 
against  situations  of  this  type  by  sr.owlng  heads  or  tails  with 
equal  probability  In  a  raaidom  fa.‘’hlon.  In  thle  way.  no  matter 
what  group  of  opponents  he  encounters,  on  the  average  his  gain 
or  lose  will  be  zero. 

This  Is  net  a  very  encouraging  result  as  far  as  Indicating 
the  utility  of  mathematical  unal^cls,  tut  let  us  persevere. 
Something  may  come  of  It. 

The  analysis  above  was  highly  plausible  and  completely  Intui¬ 
tive.  How  can  we  obtain  tneae  results  In  a  aystemallc  fashion 
which  puts  lees  strain  upon  our  Intelligence  and  more  upon  our 
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mathematlcs?  The  advantage  In  developing  a  systematic  approach 
lies  in  the  poselbillty  that  this  same  approach  may  possibly 
be  useful  in  connection  with  other  processes  of  less  trivial 
nature . 

Let  us  now  suppose  that  we  have  two  players  guided  by  the 
same  principle,  that  of  maximizing  average  return.  Let  the  first 
player  play  heads  with  probability  a  and  tells  with  probability 
1-a,  and  let  the  second  player  show  heads  with  probability  b, 
and  tails  with  probability  1-b. 

The  probability  th^t  the  first  player  wins  is  then 

(1)  ab  >  (l-a)(l-b), 

the  total  probability  that  the  two  coins  match,  .hlle  the  pro¬ 
bability  that  the  other  plviyer  wins  is  given  by 

(2)  a(l-b)  +  b(l-a  ) . 

Let  us  agree  to  credit  the  flrrt  player  with  fl  for  a  win 
and  —1  for  a  loss,  so  that  the  average  gain  per  play  to  the  first 
player  will  be 

(3)  E(a.b)  -  ab  >  (l-6)(l-b)  -  [a (1-b)  b(l-a)] 

and  the  average  gain  per  play  to  the  second  player  will  be  the 
nega 1 1 ve  of  this . 

The  first  player  will  then  choose  a  so  as  to  maximize  E(a,b), 
and  the  second  player  chooses  b  so  as  to  minimize.  The  least  he 
can  receive  is  then 

(4)  E  -  Min  Max  E(a,b). 

O^b^l  O^a^l 
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Let  U8  now  perform  some  elementary  calculations.  We  have 

(5)  E(a.b)  -  (?a-l)  v'b-)) 
so  that 

(6)  Man  E(a,b)  -  Max 
O^a^l 

Hence  the  minimum  over  b  of  this  exiresslon  occuie  where  l  1/2. 
Thus 

(7)  Min  Max  E(a,b)  -  0. 

O^b^l  O^a^l 

Now  consider  tne  situation  of  the  second  player.  He  proceeds 
to  choose  b  so  as  to  minimize  E(a,b),  yielding  the  fln.^t  [layer 
an  average  return  of 

(b)  Min  £(a,c}  -Min  (?a-l , l-ba ) . 

O^b^l 

It  follows  that  the  first  player  to  maximize  his  return  must  choo' 

a  -  1/2. 

From  these  calculations,  we  see  that  each  player,  If  ^.e  doer- 
not  know  what  tne  otner  person  Is  doing  Is  forced  to  protect 
himself  against  loss  by  using  the  equal— proLaPll’ ty  policy.  Thl.*’ 
guarantees  that  on  the  average  ne  will  win  as  often  as  he  loses. 

Observe  something  fascinating  about  this  rltuatlon.  fu}  pose 
that  the  first  plaver  announces  openly  that  he  1?  using  t^ll?  equal 
probability  policy.  Then  the  other-  player  cannot  Imj  rove  hlL^ 
average  return  even  with  this  additional  Informiatlon  at  his  dispo¬ 
sal  ,  and  vice  versa . 
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If  one  player  can  induce  the  other  player  to  depart  from  a 
safe  policy  by  using  over  a  number  of  plays  a  policy  wnlch  appears 
to  be  of  one  type,  but  which  is  actually  of  another,  then  he  can 
gain.  This  Is,  however,  a  risky  maneuver.  The  mathematical 
theory  of  these  more  complicated  processes  Is  part  of  the  study 
of  ’’learning  processes",  of  wnlch  one  aspect  Is  the  theory  of 
sequential  analysis  developed  by  Wald,  .  For  otr.er  aspects 
of  the  theory  of  learning  processes,  see  .Johnson  and  Karlin,  [12] , 
Bellman,  Harris  and  Snaplro,  [?] ,  Bellman,  ^Ij,  and  the  book 
by  Bush  and  Hosteller,  . 

Unsymroetrlc  Col n— Matching . 

Suppose  we  now  consider  a  coin  matchilng  game  of  the  following 
type.  If  two  heads  occur,  the  first  player  receives  units;  If 
two  tails,  he  receives  1  unit;  If  head— tails,  or  tail-heads,  he 
gives  the  other  player  2  units. 

Faced  wltn  an  Invitation  to  play  this  game,  the  first  player 
must  decide  whether  he  wants  to  play  or  not.  Let  us  see  the  type 
of  analysis  he  might  employ.  As  above,  we  make  the  assumption 
tnat  average  rtturn  Is  to  be  the  criterion  for  both  players.  Let 
a  be  the  probability  with  wnl  ti  the  first  player  displays  heads, 
and  b  the  nrobablllty  wltr,  wnlch  the  second  player  displays  heads. 
Then  the  expectel  return  to  the  first  player  Is 

(1)  Ii.(6,b)  *  5^b  (l— a)(l**b)  —  a  (1— b )— t  ( 1— a )  • 

Again,  as  above,  the  first  player  wishes  to  maxlTilze  this  function 
over  o’l  /aluep  of  a  In  j^econ  i  player  wishes  to 

minimize  ll  over  t  In  the  same  Interval.  Depending  upon  which 


•fijt  er  e  .Icr  has 
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point  of  view  one  takes,  we  see  that  we  have  to  ieterrnine  the 
two  quantities 

(2)  Min  Max  fc(a,b), 

0^b;^l  0^8  ^1 

and 

(3)  Max  Min  E(a,b). 

O^a^l  O^b^l 

Carrying  out  the  coTiputatlon ,  at  above,  we  see  that  the  t^o 
quantities  are  rather  sur;. r  1 1 l:igly  equal,  and  furthernore  tnat 
the  corrmon  value  Is  attained  far  the  following  probat  1 1 1 1 let ; 

(4)  a  -  V^.  b  -  V«- 

It  foil  w£  that  e^ch  player  can  announce  behavior  without  giving 
any  advantage  to  the  other. 

§4 .  Saddlepolnts . 

The  substance  of  the  atove  result  Is  tl.at  tne  funclloti  L(a,b) 
possesses  a  saddlepolnt  over  tue  square  define:  uy  0  <  a ,  h  <  1 . 
This  Is  to  say  that  there  Is  a  point  wltr.  tne  j  roj.crty 

that 

(1)  E(a' ,b')  <  t(a',b),  0  <  c  ^  1. 

E(a',b')  >  E(a,b’  ),  0  ^  a  <  i. 

In  more  complicated  games,  cptl-nal  ^-trategler  neen  not  be 
unique,  as  they  have  teen  In  these  two  caser  conrl  lereJ  atove. 
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§5.  A  General  Two— Choice  Game . 

Let  us  now  consider  s  f^eneral  two— choice  game,  where  the 
two  choices  may  be  thoughit  of  ac  neacis  or  tails.  Let  us  define 


(1)  »  the  return,  or  pay-off ,  to  the  first  player  If 

he  makes  the  1-th  choice  and  the  second  player 
makes  the  J-th  choice,  1,J,  -  1,P. 

The  matrix 

(?)  A  . 

\\n 

Is  called  the  payoff  matrix .  The  pay-off  matrix  for  the  second 
player-  will  be  the  negatl/e  of  this.  This  type  of  game  Is 
called  zero-sum ,  and  Ir  the  only  type  for  wnlch  there  Is  a 
satisfactory  theory  at  the  present  time. 

fit,  each  play  of  the  game,  let  the  first  player  make  the  first 
Choice  with  probability  x  and  the  second  choice  with  froba‘'IlIty 
X.-.  A  set  of  volue.*-^  (x^,  x,^)  ^Ith  x^  +  x^  ■  1,  x^,  x,  >  0,  la 
called  a  strategy .  similarly,  let  the  secorid  l  layer  make  nl.s 
first  choice  with  ;roLablllt-/  and  r.ls  secor,  i  cr.olce  with 
protatlllty  y,.  Tner.  t^'.e  expected  return  to  the  first  {layer  Is 


(3) 


r.(x  .y) 


a  1  1  1  y  1  *■  a  .  -j  y 


111-'  1 


L.' 


We  leave  It  as  a  simple  exercise  In  alp-.etir. ,  or  analytic 
geometry,  for  tne  reader  to  prove  that 


(M 


Max  Mir.  E(x,y)  ^  Mir.  Max  E(x,y), 
X  y  y  X 
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where  the  maxl-nuTi  Is  taken  over  tne  region  x,  «  1, 

*1*  yp  *  -^1'  yp  2  corr’ion  value  of  these 

two  expressions  Is  called  the  value  of  the  game,  and  occasionally 
denoted  by  v(A ) . 

As  above,  it  follows  that  the  flrrt  player  possesses  a 
strategy  wnlch  guarantee?  him  an  expectei  return  of  jt  lea’t  v, 
while  the  second  player  pos?e?sen  a  strategy  whlcr.  guarantees 
that  he  will  not  sustain  a  loss  of  more  than  v.  If  v  Is  negative, 
we  can.  If  we  wish.  Interchange  tne  term?  "return"  and  "lo??". 
These  strategies  need  not  be  unique. 

§^*  The  General  Finite  Came. 

Having  Introcjced  the  above  notation,  It  Is  now  ear>  tu 
continue  to  a  discussion  of  a  more  gei.eral  sltnatlor.  -%Mcn  eacf 
player  porseesee  a  finite  number  ol  cno*.  cer. 

Assume  that  th.e  first  player  may  make  one  of  M  choice?  ari.i 
that  the  second  player  may  make  aray  of  ci.olcer.  Let  I  .e  ,  vy-oi  r 
matrix  be,  as  above 

(U  A  .  (ajj), 

where  a.  ,  Is  tne  retjrr  to  the  first  ;  layer  If  r.e  -naket  tne  I  -tr. 

^  V' 

choice  and  tne  reconi  plsjer  makes  tbe  .  choice.  Tne  r.egatlve 
of  this  will  then  be  tne  retv.rn  to  th  cond  player. 

If  the  !'lT;'t  player  employs  a  strategy  x  »  (x^,x,  ,...,Xj^} 
and  the  second  player  employ.®  t.‘.e  .''trategy  y  »  (y^iy 
the  expecteu  return  to  the  first  player  will  be 

M,N 

(p)  h,(x,>)  »  2.  a.,x,y^. 

'  i,:-i  *  - 


The  funflamerital  result  In  the  tneor>  of  games  is 
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Theorem  (Mir. -Wax  Theorem  of  von  Neumann ) .  The  function  £^(x,y) 
popsesser  a  sadd le— point  over  the  region  defined  by 

(4)  a.  x^,  y,  >  0 

M  N 

b.  I  X  »  1,  Z  y  »  1. 

1-1  ^  1-1  ^ 

Hence 

(^)  Min  Max  E.(x,y)  -  Max  Min  F.(x,y)  »  v. 

y  X  ^  X  y 

Consequently,  the  first  player  possesses  at  least  one  strategy 
wTii cn  gua rantees  him  an  expecte .  return  of  least  v ,  regardless 
of  ^ia_t  the  second  i  layer  does  and  the  second  player  possesses 
1  lke><l se  at  least  one  strategy  wltlcr.  guarantees  that  ^1^  ex^^ected 
loss  is  not  more  than  v ,  regardless  of  what  the  first  player  doee . 

There  are  no  simple  proofs  of  tr.ls  result  altnough  there 
are  elementarv  ones.  The  sh.ortest  proofs  require  fixed-point 
t.’.eorems  borrowed  from  topology. 

Tnls  result  wur  establlsned  ty  Vori  Neumann  In  1 92^ »  In  the 
general  case,  while  tne  rartlcilar  cases  N  »  2,^,4  were  considered 
by  Borel.  Unf 01  t  jria te  1  j  ,  Eorel  at  first  believed  that  the  general 
cnse  ^as  not  true.  For  an  Ir'terestlng  discussion  of  que-^tlons 
ol  priority  see  tne  article  ty  Frechet  ,  j^llJ  and  the  retuttal  by 
Von  Neumar.n, 

§7.  Comnutat  1  ona  I  A  Igorl  tr.ms  . 


Tne  jeterml  fii- 1*  on  jf  tne  value  of  a  game  associated  with  a 
matrix  of  even  moderate  size,  say  ten— by-ten,  Is  not  an  easy  task. 
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No  explicit  analytic  representation  of  the  value  exists,  nor 
does  there  exist  any  analytic  representation  of  the  set  of  optimal 
strategies.  Furthermore,  even  If  these  analytic  representations 
did  exist  there  Is  no  guarantee  that  they  would  actually  be 
useful  for  coaputatlonai  purposes.  Consider,  as  an  Illustration 
of  this,  the  simpler  problem  of  solving  a  system  of  ter- by— ten 
linear  equations.  The  explicit  solution  by  means  of  Cramer's 
rule  can  be  used  effectively  Ir.  only  very  rare  c Ircums lances  to 
determine  the  numerical  solution. 

Consequently,  the  problem  of  computing  the  value  of  a  game 
resolves  itself  into  a  hunt  for  effective  numerical  algorithms. 

Cne  of  the  most  Important  algorithms  exploits  the  connection 
between  a  mult!— stage  game  and  the  original  game,  viewed  as  the 
steady  state  version  of  the  dynamic  game.  This  { rocedure  was  In¬ 
augurated  by  Bi'own  and  von  Neumann,  ^9],  and  Its  validity  was 
established  by  J.  Robinson  [l7j  .  A  continuous  version  of  thlis 
process  was  also  considered  oy  von  Keumanr',  [9jtCf.  also  Belltrian, 
[2*  for  a  generalization.  Thcie  ure  a  nur.cer  of  ot..er  teciinlquet 
based  upon  the  connection  between  tre  theory  of  games  arui  linear 
programming,  cf.  . 

In  any  particular  ca.se,  a  great  deal  can  usuallv  oe  done  by 
the  use  of  dominance  argunents,  which  greatly  simplify  the  search 
for  a  solution  by  eliminating  certain  feasible,  out  obviously 
Inefficient,  strategies  at  the  outset. 
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§b.  Contlnuoug  Csmeg . 

In  the  preceding  rectlons,  we  have  asaumed  that  each  player 
had  a  finite  number  of  cholcea.  Let  ue  now  conalder  a  more 
general  eltuatlon  where  each  player  hag  a  continuum  of  choicer. 

Suppose  that  each  player  Is  to  choose  a  number  In  the 
Interval  [0,l].  If  the  first  player  chooses  x  and  the  second 
player  chooses  y,  the  payoff  to  the  first  player  Is  determined 
by  the  function  K(x,y),  with  tne  negative  of  thla  the  return  to 
the  secona  player. 

In  order  to  mix  choices,  each  pl®y«**  chooses  a  distribution 
function,  dG(x)  for  the  first  player  and  dH(y)  for  the  second 
player.  The  expected  return  for  the  flrrt  player  Is  then  given 
by 

1  1 

(1)  y''  KiX,y)dClx)dH(y) 

o  o 

The  analogue  of  the  fundamental  result  of  von  .Neumann  Is  the 
result,  due  to  Vllle,  [2l] ,  that 

(2)  Max  Min  ^/^K{x,y)dC(x)dK(y)  -  Min  Max  /J/^K(x ,y )dC (x )dH(y ) 

C  H  o  o  H  C  0*^0 

provided  that  K(x,y)  Is  continuous  over  the  square  0  £  x,y  ^  1. 

One  of  the  reasons  for  considering  continuous  games  lies  in 
the  fact  that  the  solution  may  be  considerably  simpler  to  obtain 
than  In  the  discrete  case.  Het^,  the  continuous  case  is  to  be 
considered  as  an  approximation  to  the  discrete  case. 

An  interesting  study  of  games  of  timing,  arising  from  the 
study  of  duels.  Is  contained  in  Lhlffman,  ^19] •  A  discussion 
of  continuous  poker  games  Is  contained  In  the  references  cited 
in  §11. 

§9*  Nop— Jero  Sum  Carnes . 

Tne  analysis  of  the  foregoing  sections  applied  to  games 
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In  which  the  players  were  In  direct  opposition.  In  the  sense  that 
a  gain  for  one  player  meant  a  loss  for  the  other.  In  a  large 
number  of  applications  of  game  theory,  say  to  economic  situations 
or  to  military  situations,  this  Is  not  the  case. 

Let  us  consider  two  simple  ways  In  which  we  can  be  forced 
to  study  non— zero  sum  games.  Returning  to  a  discussion  of  the 
two— choice  game,  assume  that  the  payoff  matrix  for  the  first 
player  Is  as  before 


but  that  the  payoff  matrix  for  the  second  player  Is  not  the 
negative  of  this,  but  rather  the  matrix 


This  means  tf  at  the  players  measure  the  outcomes  of  dedelcns 
In  different  ways.  This  Is  the  usual  sltjatlon. 

The  first  player  as  before  wishes  to  play  to  as  to  maximize 
the  quantity  ab  4-  (l-a)(l-b)  -  (a{l— b))  -  b(l-a),  but  the  second 
player  wishes  to  play  so  as  to  maximize  the  quantity  — 2ab  —  2(1— a)(l-b) 
♦  a(l-b}  -f  b(l-a).  In  the  case  where  b  »  —A,  these  two  alms  were 
in  direct  opposition,  so  that  we  could  combine  these  two  types 
of  play  Into  one  min— max  cltuatlon.  This  we  can  no  longer  do. 

What  then  do  we  do?  The  answer  Is  t.hat  nobody  knows  for  sure. 
Tliere  are  a  number  of  tentative  proposals,  cf.  von  Neumann— Morgen- 
stern,  [2^ ,  Nash,  [l^ ,  which  yield  Interesting  and  Informative 
results  In  some  cases,  but  there  Is  no  uniform  satisfying  theory 
corresponding  to  the  zero-sum  case.  An  Interesting  discussion  is 
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contained  In  NcKlnaey,  . 

§10  Different  Criteria . 

We  encountered  difficulties  in  the  previous  section  because 
we  allowed  different  pay-off  matrices  for  the  players.  Suppose 
we  Insist  that  fi  *  —A,  but  now  assume  that  the  two  players  have 
different  theories  as  to  how  one  should  proceed  In  processes  of 
this  type. 

For  example,  one  player  may  be  perfectly  willing  to  rnaxlmize 
his  expected  return,  while  the  other  player  may  wish  to  maximize 
the  probability  that  ne  wins  a  certain  amount,  or  being  o  conser¬ 
vative  type,  may  wish  to  minimize  the  probability  that  he  loses 
more  than  a  certain  quantity. 

Using  these  different  criteria,  we  are  once  again  faced  with 
a  situation  in  whicn  the  two  players  are  not  in  direct  opposition, 
and  there  is  the  same  lack  of  a  definitive  theory  noted  in  the 
preceding  sectlo»'. 

§1.  .  N— person  Games  —  N  2  3* 

In  applications,  particularly  of  an  economic  nature,  we 
encounter  processes  in  which  there  are  more  than  two  players. 
Examples  of  this  are  furnished  by  bidding  on  industrial  contracts, 
and  by  a  number  of  games  of  social  nature  such  as  bridge  and  poker. 

Analysing  the  problem  in  a  purely  rational  manner,  it  turns 
out  that  tne  obvious  thing  to  do  in  some  cases  is  for  two  of  the 
players  to  form  a  coalition  against  the  third,  or  In  other  cases 
for  all  three  of  the  players  to  form  a  coalition  against  the 
consumer.  However,  the  rules  of  the  game  may  forbid  this  simple 
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solutlon.  How  then  to  play? 

Again,  there  are  a  numter  of  tentative  theoriee,  cf.  von 
Neumann-Morgenstern,  [2^,  Nash,  [l^ ,  and  S.iapiey,  ,  but  all 
have  a  number  of  drawbacks.  At  the  present  time  it  appears  as 
if  there  will  never  be  any  unitary  theory  of  N— person  games, 
but  only  a  number  of  theories,  each  satisfactory  within  Its 
domain,  but  Incapable  of  being  stretched  to  cover  the  entire 
region  of  Interest. 

§12.  Poker. 

Aa  soon  as  one  hears  the  term  "theory  of  games",  one  Is 
Intrigued  by  the  possibility  of  applying  this  theory  to  the  treat¬ 
ment  of  auch  pastimes  as  poker.  Prom  what  we  have  said  above. 

It  la  not  to  be  expected  that  much  can  be  done  in  connection 
with  actual  poker,  where  there  are  six,  seven  or  eight  players, 
each  with  quite  different  utility  functions.  However,  it  is 
Interesting  to  analyze  some  simple  two— person  games  In  the  hopes 
of  being  able  to  understand  some  aspects  of  such  characteristic 
features  as  bluffing,  and,  generally,  as  a  matter  of  intellectual 
curiosity. 

A  considerable  amount  of  material  on  coker  rnay  be  found  in 
the  book  by  von  Neumann— Morgens  tern,  f  ^  .  It  turns  out,  however, 
that  it  ia  considerably  simpler  tc  consider  some  continuous 
versions  of  poker,  cf.  Bellman— Blackwell ,  [6 j ,  and  bellman,  [3]* 

An  analysis  of  a  three-person  poker  game  using  the  equilibrium 
point  theory  of  Nash  may  be  found  in  Nash— Shapley ,  ,  where 

other  references  may  be  found. 
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Ae  far  as  applying  these  results  to  actual  play  is  concerned, 
let  us  state  the  general  rule  that  the  only  way  to  play  poker  Is 
to  play  according  to  the  opponents  and  not  according  to  some 
rigid  preconceived  theory. 

§1?.  Games  of  Survival . 

A.nother  very  Interesting  class  of  games  are  those  which  have 
been  given  the  name  "games  of  survival".  These  correspond  to 
the  classic  "gambler's  ruin"  In  which  two  players  sit  down  and 
play  until  one  or  the  other  of  the  players  has  all  the  money  In 
the  game.  What  distinguishes  this  type  of  game  from  those  con¬ 
sidered  stove.  Is  not  so  much  the  multi-stage  asi)ect,  but  the  feet 
that  there  Is  a  correlation  between  stages  due  to  the  fact  that 
the  choices  available  to  each  placer  at  each  stage  depends  upon 
the  amount  of  money  he  has  at  this  stage. 

For  a  discussion  of  multi— etage  games  and  games  of  survival, 
we  refer  the  reader  to  Bellman,  [4],  Mllnor  and  Shapley,  [l4]. 

The  functional  equation  technique  of  dynamic  programming,  ^5j * 

Is  useful  In  the  discussion  of  these  processes. 
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